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Abstract 

We construct and study an extended random matrix model of RNA (polymer) 
folding. A perturbation which acts on all the nucleotides in the chain is added to the 
action of the RNA partition function. The effect of this perturbation on the partition 
function and the Genus Distributions is studied. This perturbation distinguishes 
between the paired and unpaired bases. For example, for a = 1 (where a is the 
ratio of the strengths of the original and perturbed term in the action) the partition 
function and genus distribution for odd lengths vanish completely. This partition 
function and the genus distribution is non-zero for even lengths where structures 
with fully paired bases only remain. This implies that (i). the genus distributions 
are different and (ii). there is a "structural transition" (from an "unpaired-paired 
base phase" to a "completely paired base phase" ) as a approaches 1 in the extended 
matrix models. We compare the results of the extended RNA model with the results 
of G. Vernizzi, H. Orland and A. Zee in PRL 94, 168103(2005). 
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1 Introduction 



RNA (Ribose Nucleic Acid) is tlie only known biomolecule wliicli plays tlie 
dual role of being a carrier of genetic information and an enzyme in impor- 
tant biological reactions [2]. RNA carries the genetic information from DNA 
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to proteins (where the process of transfer of information from DNA to RNA is 
referred to as Transcription and from RNA to proteins is called Translation). 
Over the years, the discovery of enzymatic and other functional roles of ri- 
bozymes (larger RNA's ~ 3000 bases long) have strengthened the importance 
of RNA in cellular functions. So a study of the RNA structure, classification 
of different levels of structure, complexity at each level, the energetics and 
stability have captured the interest of scientists from all genres. 

Structurally, bio-polymers have been classified into three hierarchical levels 
which are graded in increasing levels of complexity and instability (in the 
given order): 

(i). Primary Structure : the chemical sequence or the sequence of nucleotides 
(planar diagrams), (ii). Secondary Structure : the local short-range pairing of 
the complementary Watson-Crick bases A-U and G-C via hydrogen bonding 
(planar diagrams) and (iii). Tertiary structure : the spatial arrangement of 
secondary units by means of several van der Waals contacts, hydrogen bond- 
ing between the complementary base pairs and the interactions in which loops 
and bulges can themselves partially pair leading to the formation of pseudo- 
knots (non planar diagrams). Pseudoknots are conformations whose associated 
disk diagrams aren't planar or we can say that pseudoknots are false knots 
which have higher genii (a genus may be understood as the number of handles 
that a surface has so that the arcs, representing pairings between the bases, 
do not intersect) and are much more complex than the previous two levels of 
structures. Most of the work in the field of RNA folding has been carried out 
on secondary structures. Monte Carlo techniques and numerous algorithms 
are available to solve problems related to secondary structures [3]fill5]l6] . Very 
little, though is known about the tertiary structure of RNA because of the 
complexity associated with the structures. Immense energies are required to 
fold RNA into their folded conformations. The energies involved are of the 
order of ~100 Kcal/mol or more [7115] . In RNA's, the complexity increases 
by manifolds when one goes from secondary to tertiary structures. Folding of 
the secondary structures to from tertiary structures is largely governed by the 
presence of Mg"^^ ions as demonstrated by experimental studies. The biolog- 
ical function of RNA is governed by these folded structures i.e the tertiary 
structures. So in order to understand the role that RNA's play in biological 
processes it becomes essential to understand their folded conformations. 

Matrix models have proved to be useful and important in areas such as disor- 
dered condensed matter systems and quantum chromodynamics among others. 
These models have been extremely useful in modeling very complex systems. 
A random matrix model has been proposed in [H^ to count and classify all 
possible structures of RNA which can exist for a given length of the nucleotide 
chain. However, the number of structures that have been discovered are a very 
small subset of these vast number of structures. This may be due to the con- 
stantly varying conditions inside a living cell. The environment of a cell is 
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subject to changing conditions of temperature, salinity, acidity etc. which act 
as perturbations to the processes taking place in a cell. These perturbations 
affect the overall functioning of the cell. So the addition of a perturbation 
in the potential of the original model of RNA [T][^ and to observe its effect 
is very important. In this work a very simple perturbation is studied which 
does not drastically change the essence of the model proposed in [T][9] . How- 
ever, it brings changes in the enumeration of structures of RNA. In particular, 
for a certain strength of this perturbation a phase of RNA structures with 
no unpaired bases is found. This will result in a limited activity of the RNA 
due to the paired conformations alone. Thus a seemingly trivial change in the 
mathematical structure of the RNA model produces structures of RNA which 
leads to a non-trivial change in their biological activity. 

The random matrix model of RNA folding proposed in |[9j is a theoretical 
model which focuses on the topological aspect of the RNA folding and is 
constructed as an (NxN) matrix field theory. In the model, a matrix Vij is 
considered which accounts for the attractive energies between the nucleotides, 
disregards any self interaction of the nucleotides, maintains finite flexibility of 
the nucleotide chain and the fact that only those nucleotides which are at least 
4 nucleotides apart can interact |9]. The model makes use of the notations of 
quantum chromodynamics in terms of the Feynmann diagrams. In the large 
N limit, matrix models can be expanded in powers of which also gives a 
topological expansion as was originally observed by G 't Hooft [TO]. This allows 
us to identify the tertiary structures as terms with powers of The RNA 
matrix model in [T] enumerates all the secondary structures with pseudoknots 
by making the following simplifications: the polymer chain is assumed to be 
infinitely flexible so that nucleotides less than 4 nucleotides apart can also 
interact, sterical constraints are neglected so that any base pairing is allowed 
irrespective of the type of nucleotide and the interaction between the bases in 
the chain is assumed to be simple with all the base-pairings taking place with 
a similar strength 'v'. This means that the interaction matrix Vij (which is a 
(LxL) matrix, L being the length of the polymer chain) has all the elements 
equal to 'v'. 

In this paper, section 2 introduces the extended matrix model where a per- 
turbation is added to all the bases in the nucleotide chain keeping the same 
simplifications as in [1] . A parameter a is found which can take different values 
and hence it distinguishes between the different generalized models. A general 
form of the partition function for the extended matrix model is found for any 
length L and a. We also discuss the extended matrix model with a = 1. In 
section 3 we discuss the diagrammatic representation of the extended matrix 
model corresponding to any a. Section 4 deals with the genus distributions 
{Ma and aL,g,a) of the extended matrix model, where the odd lengths vanish 
in the partition function Zi^a{N), as a approaches 1. We will refer to the 
RNA folding model in [TII^ as the RNA-MM and the extended matrix model 
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as RNA-EMM from now on in the literature. 



2 Extended Matrix Model of RNA (RNA-EMM) with a Pertur- 
bation on all the Bases 



We propose an extension of the RNA-MM discussed in p[j. Here we add a 
perturbation term in the action of the partition function of the nucleotide 
chain. The partition function of the extended matrix model is, 



Yl d(j)iexp 



i=l 



i=l 



where (pi are L independent (NxN) hermitian matrices, Vij is an (LxL) in- 
teraction matrix containing interactions between the nucleotides, L being the 
length of the nucleotide chain, Yli{l + (pi) is an ordered product over 0j's and 
Al{N) is the normalization constant given by, 

MN) = jf[ d(P,exp~^ E^.l(v^..)-^^'•^«^.exp(-^)Stl(^»)-^^^<^^ (2) 

i=l 

These RNA matrix models (both RNA-MM and RNA-EMM) are variants of 
Gaussian Penner Models. Addition of a linear term in this action is non-trivial 
as shown in references [TT|T2] where a linear term is added to the potential 
^(0) = ^70^ + of the partition function for the multi-cut matrix model. 

The perturbation term in the partition function of equation (1) is, 

ea;p(-^)i:tiW)-^^^<^» (3) 



where Wi = w,w being the strength of the perturbation. We consider Vij = v. 
The normalization constant ^^(A^) can be written as Al{N) = exp'^ ^"^^ 
/nj^=i where $i = {(pi + VijWf^). Carrying out a se- 
ries of Hubbard Stratonovich Transformations, the integral in equation (1) is 
reduced to an integral over a single (NxN) matrix a, 

ZlAN) = J daexp-^'^^^^^^^'^Tril + af (4) 
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where RiiN) = J daexp~^'^'^^'^~^^'^^ . We make a redefinition such that a' = 
(cr+^) = (a+a) where we define a = ^, the ratio of the strength of interaction 
between the nucleotides to the strength of the perturbation. We therefore get 
from equation (4), 

ZlAN) = / da'exp-^^^'^^'^'^Tril + a' - . (5) 

We now introduce the spectral density pN,aW ^ Gaussian matrix model 
defined at finite N as, 

PN,M = J da'exp-^^^^'-^'^'^TrdiX - a'). (6) 



Making use of the identity dXp^A^^) = 1 in equation (5) the partition 
function can be written as, 



ZlAN) = / rfAp^,a(A)(l + A - af. (7) 

Defining G{t, N, a) as the exponential generating function of ZlA-^) 
write. 



t r+oo ' — ' 

Git,N,a) = ZlAN)y7 = / rfAp^,a(A) ^ 

r r. -L'' J— OO r n 



L=0 L=0 



G{t,N,a)= I dApjv,a(A)exp*(^+^-"). (9) 



We use the form of spectral density PnA-^) froni 



where H2k{X^ ^) represents Hermite polynomials. Using equation (10) the 
generating function iV, a) in equation (9) can be written as. 



N-l 



G{t,N,a) 



V2n^ to 



N 



k + l 2^k\ 



dXexp' 



exp 



t{X+l-a) 
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Table 1 

The Table lists the partition functions Zi^ai^) for a = (0.25,0.5) for different 
lengths L. 



L 


ZlAN) [a = 0.25] 


ZlAN) [a = 0.5] 


1 


3/4 


1/2 


2 


v+9/16 


v+1/4 


3 


9f/4+27/64 


3v/2 + 1/8 


4 


81/256+27z;/8+2'(;2+?;2/Ar2 


l/16+3t;/2+2w2+t;2/iv2 


5 


243/1024+135t;/32+15i;V2+ 


l/32+5v/4+5f 2 +5u V2iV^ 


6 


729/4096+1215?; /256+135v2 /8+ 
135v'^/16N'^+5v^+10v^/N'^ 


l/64+15u/16+15'yV2+15^;V4iV^+5'f^^+10'f^V^^ 


7 


2187/16384+5103?;/1024+945z;2 /32+ 
945t7V64iV2_^105vV4+105w3/2A^2 


l/128+21i;/32+35wV4+35t;V8A^^+35v3/2+35?;3/iV2 


8 


6561 /65536+5103?;/1024+2835t;2 /64+ 
2835i;Vl28A^^+315u3/4+315u3/2iV2 

+ 141;"^ +2 lu VA^'' + 70^;^ /iV^ 


1 /256+7?;/16+35t;2 /4+35t;2 /sn'^+35v'^+ 
70v^ /N^ + 147;"^ +2 It;-^ /N'^+70v'^ /N'^ 



Rewriting the above equation, 



G{t,N,a) 



exp 



E 

k=0 



N \ 1 



k + l 2^k\ 



dXexp 



N ^ /2vt 



:i2) 



Using a standard result of integration over Hermite polynomials [15] , 

dx exp-'-^'-y^' Hn{x) = v^?/"2" (13) 

J —oo 



equation (12) solves to. 



G{t, N, a) = exp''- 



'r+t{l-a) 



N ^^\k + l) k\Nk 



i2\k 



(14) 
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Table 2 

The Table lists the partition functions Zi^a{N) for any a and for different lengths 
L. 



L 


ZL,a{N) [RNA- EMM] 


1 


1-a 


2 


l+v+(a)2-2(a) 


3 


l+3v-(a)3+3(a)2-3(a)-3v(a) 


4 


l+6v+2?;2+t;2/Ar2+(a)4-4(a)3+6(a)2+6v(a)2-4(a)-12v(a) 


5 


l-(a)5+5(a)^-10(a)3+10(a)2-5(a)+10v+10?;2+5w2/iV2_;L0v(a)3+30v(a)2- 
30v(a)-10T;2 (a)-5w2 (a) /N'^ 


6 


l+(a)6-6(a)^+15(a)^-20(a)3+15(a)2-6(a)+15v+30w2+15wV^^^+5u^+10vV^^+15v(a)^- 
60v(a)3+90v(a)2+30w2(a)2+l5w2(a)2/Ar2.60v(a)-60t;2(a)-30w2(a)/iV2 


7 


l-(a)'^+7(a)6-21(a)5+35(a)^-35(a)3+21(a)2-7(a)+21v+70u2 + 

35wV^^^+35w2+70t;3/iV2_2iv(a)5+i05v(a)'^-210v(a)3-70w2(a)3-35w2(a)VA^^+210v(a)2 + 
210v'^ {af+W5v^ (a) VA^^-105v(a)-210?;2 (a)-105u2 (a) /N'^-35v^ {a)-70v''^ (a) /iV^ 



Combining equations (8) and (14) we have, 

oo J.L 2 

G(t,iV,a) ^ ^ ^L,a(iV)7T = expw+*(i-") 

L=0 ^■ 



We observe from equation (15) that by taking different values of a (i.e. by 
varying the ratio of the strengths of interaction between the bases and the 
perturbation in the action of the partition function) we can have several ex- 
tensions of the same model. For instance, for a = 0, we have the RNA-MM [Ij. 
We substitute a = 0.25, 0.5, 0.75 in equation (15) and find their corresponding 
partition functions {a = 0.25,0.5 are listed in Table 1). For completeness we 
write the explicit dependence of the partition function ZL,a{N) on N in terms 
of (the topological parameter), genus g, as in [1], ZL,a{N) = Yf^=oCiL,g,a^- 
Here, the coefficients a^^g^aS give the number of diagrams (structures) at a 
given length L, genus g and a. We also define the total number of diagrams 
as Afa, for a particular length and a, irrespective of the genus. 

The General Form of Zi aiN): The general form of the partition function for 
the extended matrix model can be obtained from equation (1) after completing 
the square in the exponent of the numerator and in the normalization constant 
Al[N) in terms of the variable $ as. 




(15) 
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Table 3 

The Table lists the partition functions ZL^a{N) for different lengths L of the nu- 
cleotide chain for a = (RNA-MM) and a = 1 (RNA-EMM). 



L 


ZL,a=o{N) [RNA-MM] 


^L,a=i(iV) [^iV^ - ^MM] 


1 


1 





2 


1+v 


V 


3 


l+3v 





4 


l+6v+2?;2+t;2/Ar2 




5 


l+10v+10w2+5t;V^^ 





6 


l+15v+30?;2+5v3+(15t;2 + l{)v^)/N'^ 




7 


l+21v+70?;2+357;3+(35i;2 + 7{)v^)/N'^ 





8 


1+28v+140i;2+140z;^+14t;4+21i;4/A^4+ 
(70w2+280?73+70w'^)/iV2 


14t;^+70'(;'^/iV2_^21?;^/7V4 



Z^,„(Ar) = -^|nci$.exp-f^^(W.^^^^^^^ (16) 

The general form of Z^ aiN) for the extended matrix model from equation 
(16) via Wick Theorem is, 

Zi,4iV) = (l-a)^ + (l-a)(^-2)^^.^.^(^_^)(L-4) ^ IA_,.\4,,+ 

i<j i<j<k<l 

(i-«)(^-^) E ^.z^,> + d)^(i-«)(^-^) E ^.^^.v 

i<j<k<l i<j<k<l 

+ (17) 



We observe that the general form of Z^^aiN) has the same structure as in 
\T\ with an additional multiplicative factor of powers of (1 — a) on each term 
(Table 2 lists Z^^aiN) upto L = 7 in terms of a's). The partition functions can 
be obtained from equation (17) by substituting the desired a for a particular 
length. 

Extended Model With a = 1 : We now substitute a = 1 in equation (15) i.e. the 
perturbation is acting on all the bases in the nucleotide chain with a strength 
the same as the interaction between the bases. We get, 
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oo fL 2 

G{t, N,a) = J2 ZlAN)j^_ = exp^ 



L=0 




:i8) 



We can see from equation (18) that only even length partition functions will be 
non-zero whereas for odd lengths, the partition function will vanish (results are 
summarized in Table 3). It will be shown in the next section that ZL^a=i{N) = 
for odd lengths implies that no unpaired base is allowed for the a = 1 phase. 
This suggests a considerable change in the genus distributions for the extended 
matrix model especially for the case when a = 1. This is investigated in detail 
in section 4. 



3 Diagrammatic Representation of the Extended Model 



We explain in this section the physical interpretation of the general form of 
ZL,a{N) obtained in section 2 for the extended matrix model. The general 
form of the partition function for the extended model is given by equation 
(17). Each term in the partition function is accompanied by a factor of powers 
of (1 — a). The partition function corresponding to L = 4 (Table 2) is given by 
^L=4,a(iV) = l + Qv + 2v^ + v^/N^ + a^-Aa^ + 6a^ + 6va'^-4:a-12va or simply 
Zl=4AN) = (l-a)^+6w(l-a)2+2t;2+'i;2/iV2. This represents a total of 10 di- 
agrams (Figure 1). In the diagrammatic representation in these models (using 
quantum chromodynamics), the RNA backbone is represented by a solid line 
(representing the quark propagator) and the hydrogen bonds between differ- 
ent nucleotides as dotted arcs (representing the gluon propagators) joining two 
nucleotides on the solid line. Therefore, a primary structure is shown with the 
nucleotides (or the dots) on the solid line, secondary structure is represented 
by the arcs (dotted lines) drawn between nucleotides (dots) on the solid line 
(provided arcs do not intersect) and tertiary structures are those secondary 
structures in which the arcs intersect [9]. In these diagrams, the powers of 'v' 
give the number of arcs in the corresponding Feynmann diagrams (diagrams 
are given by the coefficients of the v term). These terms correspond to the 
planar diagrams. The terms with powers of l/N"^ represent tertiary structures 
with non-zero genus. The first term in the partition function for L = 4 is a 
planar term with each unpaired base associated with the factor of (1 — a), 
the second term represents 6 diagrams with one arc each and each unpaired 
base (2 in number) accompanied by (1 — a), the third term corresponds to 
two diagrams with two arcs each and no unpaired bases at all and the last 
term represents one diagram with two crossing arcs i.e. a tertiary term with 
genus one and no unpaired bases. These diagrams are shown in Figure 1 with 
their respective weights. The power of (1 — a) gives the number of unpaired 
bases in the diagram. However, for a = 1 the partition function for L = 4 is 
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Fig. 1. The figure shows ah the possible diagrams for L = 4 for the extended matrix 
model. The paired bases are shown to be connected by arcs. Each unpaired base is 
associated with a factor of (1 — a). 



given by ZL=4^^a=i{N) = 2v'^ + v'^/N'^ i.e. a total of 3 diagrams (in Figure 1, 
the diagrams corresponding to f ^ , (? = and g = 1). For this a value, only 
those diagrams with completely paired bases remain. The diagrams with even 
a single unpaired base are completely suppressed. 

We thus observe that for a < 1, all kinds of structures with fully unpaired, 
partially paired-partially unpaired and completely paired bases exist. However, 
for a = 1, only structures with completely paired bases exist. We refer to these 
structural differences between the two regions a < 1 and a = 1 as a "structural 
transition" from an "unpaired-paired base phase" to a "completely paired base 
phase" respectively. Unpaired-paired base phase comprises of all the structures 
enumerated by the RNA-MM in [T] with different weights associated with 
each unpaired base for different a's. A completely paired base phase consists 
of only those structures which have no unpaired bases at all. For L = 4 and 
any given a, aL=4^g=o = 9 and Afa = 10, so aL=4,g=o/A/'a = 0.9. This value of 
the normalized distribution can be seen in Figure 4(a) (i.e. corresponding to 
g = 0). For L = 4 and a = 1, aL=4^g=o = 2 and A/'q, = 3, so aL=4,g=o/Afa = 0.66. 
This value of the normalized distribution can also be seen in Figure 4(a). 

We have seen that this perturbation has created a phase where RNA struc- 
tures with a limited biological activity (as only structures with paired bases 
are possible) are separated out from the otherwise possible vast number of 
structures. 
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Fig. 2. (a) For a = 1, the normalized diagrams, ai^g^a/Ma are plotted with respect 
to the nucleotide chain lengths L, keeping the genus g fixed. Here, aL,g,a are the 
total number of diagrams at a given L, g and a and Na are the total number of 
diagrams at a particular L and a, irrespective of g. (b) For a = 1, the normalized 
diag rams Ob'L,g,oLl ^ a are plotted as a function of genus. The length L of the chain is 
kept fixed in order to obtain the genus distribution patterns for a specific length. 
Lengths considered are L=40,80,120,160. in the figures should be understood 



as 



0-L.g,a = \ 



4 Genus Distributions for the Extended Matrix Models 



We analyze the genus distributions for the extended matrix model and com- 
pare them with the distributions of RNA-MM [1]. We consider in particular 
the extended matrix model with a = 1. For a = 1 the genus distributions 
are shown in Figure 2. Figure 2(a) plots the normalized diagrams aL,g,a=i/-^a 
Vs Length L at a fixed genus g. Figure 2(b) shows a plot of the normalized 
diagrams aL,g,a=il-^a Vs g at a particular length L. As indicated by the gen- 
erating function in equation (18), we find that the partition function for odd 
lengths vanish (Table 3). The genus distributions are shown in Figure 2. 

We compare the genus distributions for the following three pairs of successive 
even and odd lengths: (i). L = (10,11) (Figures 3(a), 3(b) respectively), (ii). 
L = (20,21) (Figures 3(c), 3(d) respectively) and (iii). L = (30,31) (Figures 
3(e), 3(f) respectively), for different a values (Figure 3). For a chosen length, 
plots for different a = 0,0.75, 1 are compared. In the figure with odd length 
compared to the figure with even length, it is observed that the a = 1 curve is 
absent. This is due to the absence of the partition function for odd lengths at 
a = 1. It is also observed that the curve corresponding to a = 0.75 at L = 11, 
comprises of points which are an average of the points on the a = 0.75 and 
a = 1 curves for L = 10 (same genus points for the two lengths are considered). 
For example in Figure 3(a) (for L = 10), the points corresponding to g = 2 
for a = 0.75 and a = 1 curves are averaged and the aL,g,al-^a value thus 
obtained is similar to the aL,g,a/-^a value for (yf = 2 on the a = 0.75 curve in 
Figure 3(b) (for L = 11). The same is observed on comparison of other lengths 
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Fig. 3. Genus Distributions for (a). L=10, (b). L=ll, (c). L=20, (d). L=21, (e). 
L=30 and (f). L=31 with a(= 0,0.75, 1) values are shown in these figures. For odd 
lengths {L = 11,21,31), the a = 1 distribution is absent. in the figures should 
be understood as -jf^- 

(L = (20, 21) and L = (30, 31)). The shape of the a = 0.75 curve for the odd 
length figure (L=ll) seems to take the shape of a = 1 curve in the even length 
figure(L=10). This is also observed in the figure aL,g,«/A/'a Vs L for different 
genii (g=0,l,2,3) when a = 0,0.75,1 are plotted together (Figure 4). Notice 
that in the curve for a = 0.75 (Figure 4) for each even-odd length, the points 
lie close together separated by large distances from the nearest neighboring 
even-odd lengths. 

This analysis extracts the otherwise not so visible differences in the (i). total 
number of structures and (ii). the shape of genus distributions for different 
a's as compared to the distributions in RNA-MM [1]. A study of the genus 
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Fig. 4. Genus Distributions for (a). g=0, (b). g=l, (c). g=2 and (d). g=3 with 
a{= 0, 0.75, 1) values are shown in these figures. in the figures should be under- 
stood as 

distributions approaching a = 1 is particularly important as dramatic changes 
take place as shown in Figures 3 and 4. 



5 Conclusions 



We have given a detailed analytic calculation of the partition function after 
introducing a linear perturbation in the action of the nucleotide chain for the 
matrix model of RNA folding (section 2). A parameter a{= v/w) is found 
explicitly in the extended matrix model (RNA-EMM). The parameter a can 
be tuned to different values to give several combinations of the strength ratio 
of the original and the perturbed term thus giving various extensions of the 
model. For instance, for a = 0, the extended matrix model (RNA-EMM) re- 
duces to the RNA-MM in [Ij. We find for a = 1, the partition function for 
odd lengths of the polymer chain vanishes completely. This brings significant 
changes in the genus distributions in terms of the number of structures and the 
shape of the distribution as compared with the RNA-MM in [1] . We have com- 
pared the genus distributions at different lengths and genus for a = 0, 0.75, 1. 
The curve corresponding to a = 0.75 at odd lengths comprises of points which 
are an average of the points on the a = 0.75 and a = 1 curves on the preced- 
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ing plot of even length (the same genus for the two lengths is considered for 
comparison). Also, the shape of the a = 0.75 curve for the odd lengths seem 
to take the shape of a = 1 curve in the plot of even length (section 4). For 
other values of a we have different results for the partition function (listed in 
Table 1). The general form of ZL^a{N) for the extended matrix model consists 
of an additional multiplicative factor of (1 — a) with each term as compared 
to [1]. The maximum value of genus g for the extended matrix model for even 
L and all a is found to be L/4 i.e. g^ax < L/A (found numerically). For odd 
lengths and a ^ 1 this result is also true and for a = 1 there are no diagrams. 

The diagrammatic representation of the extended matrix model is discussed 
in section 3. This follows from the general form of Zi^aiN) for the extended 
matrix model found in section 2. The Feynmann diagrams corresponding to 
length L = 4 of the polymer chain are shown in Figure 1. Each unpaired 
base in the polymer chain is associated with a factor of (1 — a). Hence for 
odd lengths of the polymer chain when a = 1 the partition function vanishes. 
ZL,a=i{N) = for odd lengths implies that no unpaired base is allowed for 
the a = 1 phase. For other values of a, the unpaired bases in the chain 
are weighted by a factor of (1 — a). We define a "structural transition" where 
structures change from a "unpaired-paired base phase" to a "completely paired 
base phase" at a = 1 (section 4). The perturbation has thus created a phase 
where RNA structures with a limited biological activity (as only structures 
with paired bases are possible) are separated out from the otherwise possible 
vast number of structures (where structures with both paired and unpaired 
bases are possible). 

Our future work will concentrate on a study of more general and realistic 
perturbations and an analysis of their genus distributions. Another interesting 
aspect of the problem is to take into consideration Watson-Crick and Wobble 
(G-U) pairings in the interaction (Vij) between the nucleotides so that the 
condition of a base pairing with any other base (in models discussed so far) is 
relaxed. This will be a more realistic approach towards the problem of RNA 
folding but it is not a trivial problem. We would like to study these generalized 
matrix models with more mathematical rigour as the addition of a linear term 
in the action can produce non-trivial results as shown in [TT][T^ for similar 
matrix models. The analysis of asymptotics for the extended matrix model 
(RNA-EMM) will also be dealt with in a future work. 
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